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I . Introduction 


This  paper  is  devoted  to  the  application  of  the  bilinear 
regulator  theory,  developed  recently  by  the  authors  [1],  to  a 
two-dimensional  pursuit-evasion  problem.  The  general  pursuit- 
evasion  problem  has  been  studied  for  many  years  resulting  in  a 
variety  of  formulations  and  control  schemes.  Among  these,  Deyst 

[2]  applied  the  theory  of  optimal  stochastic  control  to  derive  a 
sophisticated  nonlinear  controller  which  lead  to  a significant 
improvement  in  performance  in  relation  to  those  resulting  from  the 
conventional  proportional  navigation  guidance  law.  On  the  other 
hand,  the  theory  of  differential  games  was  considered  by  Ho  et  al 

[3]  and  Raj an  [4]  to  determine  optimal  strategies  for  this  problem. 
Recently , Slater [ 5]  and  Nazarof f [6]took  the  perturbation  point  of 
view  in  applying  optimal  control  theory  to  this  two-dimensional 
pursuit-evasion  problem. 

However,  due  to  the  highly  nonlinear  (transcendental  type) 
structure  underlying  the  system,  the  usual  linearization  techniques 
are  not  sufficient  to  render  an  accurate  maneuver  of  the  pursuer 
under  a variety  of  operational  circumstances  [5] . On  the  other 
hand,  the  currently  developed  numerical  approaches  in  solving  this 
terminal  constraint  optimal  control  problem  generally  require  a 
tremendous  amount  of  storage  space  in  addition  to  a significant  com- 
puting time.  Therefore,  a direct  approach  using  a bilinear  control 
formulation  is  considered  here  which  takes  advantage  of  the  merits 
of  a commutative  bilinear  system  [1]  to  analyze  the  nonlinear  opti- 
mization problem  directly.  A closed  form  solution  to  this  problem 
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is  obtained.  It  provides  a great  deal  of  potential  to  be  implemented 
. in  practice  due  to  the  structural  constancy  of  the  controller. 

The  commutative  property  of  such  a bilinear  system  is  introduced 
by  an  additional  assumption,  beyond  the  traditional  control  con- 
figuration, that  the  line  speed  of  the  pursuer  (missile)  can  be 
manipulated  proportional  to  its  turn  rate  which  is  regarded  as  the 
major  control  variable.  The  line  speed  and  turn  rate  of  the  target 
are  assumed  to  be  known  constants.  For  a given  initial  position  of 
the  target  and  relative  heading  with  respect  to  the  missile,  the 
problem  is  to  determine  a proportional  factor  y , a proper  inter- 
cept time  T and  a control  law  (turn  rate  of  the  missile)  which 
steers  the  missile  to  the  target  at  T;  meanwhile,  the  total  con- 
trol energy  spent  during  this  interval  is  minimized.  The  existence 
theorems  for  the  bilinear  regulator  problem  [1]  are  applied  to  show 
the  simple  form  of  the  optimal  control  law  which  is  a constant  and 
can  be  solved  explicitly  from  the  boundary  conditions.  It  is  thus 
shown  that  the  missile  is  globally  controllable  to  the  target  with 
the  control  law  formulated  here  provided  the  turn  rate  of  the  target 
does  not  vanish. 

A singular  perturbation  of  this  system  is  also  considered. 
Singularly  perturbed  optimal  control  problems  have  been  studied  in 
Reference  [7]  where  the  asymptotic  approximation  approach  is  con- 
sidered in  obtaining  asymptotic  solutions  for  either  linear  or  non- 
linear systems.  Here  we  will  derive  the  optimal  control  in  closed  form 
for  this  problem  and  show  that  this  solution  converges  to  that  of  the 
f unperturbed  system  as  the  degree  of  perturbation  e approaches  zero. 
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A least  squares  estimation  scheme  is  developed  to  estimate 
the  target  speed  and  relative  heading  based  on  the  known  target 
and  missile  turn  rates  and  continuous  measurements  of  the  position 
vector.  This  scheme  in  combination  with  the  optimal  control  law 
is  simulated  under  a variety  of  initial  conditions  and  target 
line  speeds  and  turn  rates  to  show  the  effectiveness  of  such  an 
optimal  controller. 

The  problems  of  further  interest  include  the  construction  of 
a feedback  control  law  which  can  guarantee  a terminal  intercept, 
the  sensitivity  analysis  as  well  as  the  stochastic  estimation  of 
the  target  speed  and  turn  rate  which  should  be  taken  into  account 
for  the  implementation  of  a desirable  controller  in  a real  pursuit- 
evasion  system. 


II . Minimum  Energy  Control  of  Commutative  Bilinear  Systems 
Consider  a general  bilinear  system 


x = (A  + l B.  u.  )x, 
i=l  1 1 


x(tQ)  = xQ 


t e [tQ,T] 


where  A,B^ (i=l, . . . ,m)  are  constant  n x n matrices;  u(t)  is 
an  m-vector  with  each  component  u^  being  a square  integrable 
function  defined  on  [tg,T].  The  system  (1)  is  commutative  if 
every  pair  of  the  matrices  (A,B^,B2/ • • • , Bm)  commute  with  each 
other. 

Due  to  its  structural  simplicity,  we  will  focus  our  attention 


on  the  commutative  bilinear  system  and  the  minimum  energv  problem 
associated  with  it  in  this  paper. 
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The  objective  is  to  minimize  the  following  cost  functional 

fT 

J(u)  = u' (t ) Ru (t ) dt  (2) 

Jto 

subject  to  the  constraint 

x(T)  = xx  (3) 

where  R is  an  m x m positive  definite  symmetric  matrix;  and 
is  a prespecified  vector. 

In  order  to  analyze  this  terminal  constraint  problem,  we  shall 
first  study  the  reachable  zone  of  the  system  (1) . We  introduce 
the  following  definitions.  Let  U be  the  set  of  admissible  controls. 

Definition  1: 

A set  Z (Xq ;U)  is  called  a reachable  set  associated  with  (1) 

if  Z(xQ;U)  = {x(T) eRn:x(tQ)  = xQ , u (t) eU , x ( t)  satisfies  (1)  in  [tQ,T]}. 

A set  Z (D;U)  is  called  a reachable  zone  associated  with  (1)  if 

Z(D;U)  = U Z(xQ;U)  . 
x0eD 

Definition  2; 

The  system  (1)  is  reachable  to  x^  with  respect  to  Xq  and 
U if  there  is  an  input  u(t)  e U which  steers  it  from  Xq  to  x.^ 
at  some  finite  time  T.  It  is  constant  reachable  to  x^  with  res- 
pect to  Xq  if  there  is  a constant  input  function  uc  which  steers 
it  from  Xq  to  x^  at  some  finite  time  T. 


5. 


The  next  lemma  reveals  an  inter  esting  property  regarding  the 
reachability  of  commutative  bilinear  systems.  The  proof  is  straight- 
forward and  may  be  found  in  Reference  [1]  or  [8] . 

Lemma  1 : 

The  commutative  bilinear  system  (1)  is  reachable  to  x.^  with 
respect  to  Xq  and  L2 ( [ tg ,T] , Rm)  if  and  only  if  it  is  constant 
reachable  to  x^  with  respect  to  Xq. 

Lemma  1 assures  that  if  u(t)  e L2 ( [tQ,T] ,Rm)  steers  the  com- 
mutative bilinear  system  from  xQ  to  x-^  at  T,  then  there  exists 
a constant  input  function  uc  which  can  do  the  same  job  as  well. 

This  enables  us  to  study  the  given  system  with  a class  of  simple 
easily- implemented  input  functions,  namely,  the  constant  input 
functions.  For  this  class  of  bilinear  systems,  the  reachable  zone 
is  much  easier  to  characterize  because  one  need  only  consider  a 
constant  input  uc  as  a set  of  parameters,  then  compute  the  corres- 
ponding transition  matrices1"  $ (t,tn)  , i = l,...,m,  which 

ci 

characterize  the  reachable  zone  of  the  given  system  (1)  independent 
of  the  initial  conditions.  The  following  theorem  describes  the  ex- 
istence and  the  form  of  optimal  controls  to  the  problem  (1)  - (3). 
Again,  the  interested  reader  is  referred  to  Reference  [1]  or  [8]  for 
the  proof. 


The  transition  matrix  4>  (t,tn)  is  defined  by  4 (t,tft)  = 

ci  ci  u 

BjLU^4>c  (t,tg)  with  <t>c  (tQ,tg^  = I,  the  identity  matrix. 
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Theorem  1 : 

Given  a commutative  bilinear  system  (1)  with  the  cost  (2) . 

If  x.^  belongs  to  the  reachable  set  Z (Xq;L2  ( [tg,T]  ,Rm)  ) , then 

* 

there  exists  a constant  optimal  control  uc  which  steers  (1) 

it 

from  Xq  to  at  T and  minimizes  (2) . Furthermore,  uc 

satisfies 

m 

X1  = $c.  *T,t0^X0' 

1=1  1 

Therefore,  the  infinite  dimensional  optimization  problem  (1)  - 

(3)  has  been  transformed  into  solving  a finite  dimensional  non- 

* 

linear  algebraic  equation  (4)  for  uc  . It  will  be  shown  in  the 
next  section  that  in  the  pursuit-evasion  problem  we  can  easily  solve 
for  the  optimal  control  in  analytic  form.  This  is  certainly  due  to 
the  structural  merit  of  a commutative  bilinear  system. 
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111 • A Two-Dimensional  Missile  Intercept  System 

For  a typical  high-speed  pursuing  missile  and  short  initial 
range,  it  is  assumed  that  the  maneuvering  of  the  vehicles  can  be 
restricted  to  a two-dimensional  plane  as  shown  in  Figure  1 with 
the  coordinates  fixed  in  the  missile. 


Fig.  1 Dynamics  of  a Two-Dimensional  Pursuit  System 


Kinematic  Equations  of  Motion  [5] . 

Denote  the  angular  rate  of  the  missile  and  the  target  with 
respect  to  a non-rotating  reference  frame  as  up  and  uT,  respect- 
ively. Then  the  equations  of  motion  are  described  by: 
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*1  = _VT  Sin  X3  + X2Up 


x_  = vm  cos  x,  - X,U  - V 
2 T 3 1 p p 


x3  = “T  ' u 


(5) 


where  vT  and  are  the  line  speeds  of  the  target  and  the 

missile  relative  to  air,  x^  and  x2  are  the  horizontal  and  ver- 
tical distance  from  the  missile,  and  x^  is  the  relative  angle 
between  the  headings  of  the  missile  and  target  measured  counter- 
clockwise. 

The  system  (5)  can  be  transformed  into  a homogeneous  bilinear 
system  by  introducing  the  auxiliary  states:  x^  = sin  x^,  x^  = cos  x 

and  Xg  = 1.  That  is, 


x = Ax  + Bxu 


(6) 


with 
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in  which  u = u is  defined  as  the  control  variable. 
P 
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The  problem  is  to  find  a square  integrable  function  u (t) 
which  steers  the  missile  to  the  target  at  some  finite  time  T 
(a  free  time  formulation)  while  the  total  control  energy  consumed 
over  this  interval  [0,T]  is  minimized.  Hence,  the  following 
cost  functional  is  considered: 


-C 


(t)  dt  , T > t. 


subject  to 


xx(T)  = x2(T)  = 0. 


Distinct  from  the  classical  proportional  navigation  guidance 
law,  a crucial  assumption  is  proposed  relative  to  the  line  speed 
Vp  of  the  missile  which  leads  to  an  explicit  solution  to  the  afore- 
mentioned problem.  That  is,  Vp  is  modeled  to  be  proportional  to 
(an  amplification  of)  u with  the  proportional  parameter  y to 

r 

be  determined  by  the  boundary  conditions.  Hence,  we  assume 


v = yu 
P ’ P 


With  this  condition.  Equation  (7)  becomes 


r 
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In  the  event  that  vT  and  uT  are  constants.  Equations  (6) 
and  (11)  can  be  integrated  explicitly  as  follows: 


x6(t)  =1  , x5(t)  = cos  x3(t)  , x4 (t)  = sin  x3(t) 


x3(t)  = x3(tQ)  + uT(t-tQ) 


-c  • 

^ A 


(12) 


(s)  ds 


and 


(t) 

x1 ( tQ ) cos 

u(s)ds  + x_(t0)sin 

u(s)ds 

fc0 

ft  | 

rt 

x2  (t) 

-x^t^sin 

u(s)ds  + x2(tQ)cos 

fc0 

u(s)ds 

t0 

(13) 


rt 

ft 

-sin[x3(tQ)  - 

uds  + u_  ( ^ — t « ) ] 

cos 

uds  -1 

' - vTf 

■fc  U 

c0 

d5  + Y 

t0 

TJ  t 

U A 

ft 

ft 

0 

„ cos[x3(tQ)  - 

uds  + u _(?-tQ) ] 

-sin 

uds 

t0 

fc0 

L- 


We  will  first  resolve  the  terminal  constraint  problem  by  con- 
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sidering  the  intercept  angle  as  a parameter,  then  incorporate  the 
solutions  with  the  minimum  energy  problem.  Considerations  should 
be  given  to  two  separate  cases  in  which  uT  is  zero  and  non-zero, 
respectively. 


Non-zero  Angular  Maneuver  of  the  Target  (uT^0) 

The  terminal  constraint  (9)  on  x^  and  x2  requires (for  some 
T > t0): 


[xx  (t0)+y]cos  [uT(T-tQ)+x3  (tQ ) — 3 ] + x2  (tQ)  sinli^  (T-tQ)+x3  (tQ) -& ] -y' 
[x1(t0)+y]sin[uT(T-t0)+x3(t0)-6]  + x2 (tQ) cos [uT (T-tQ) +x3 (tQ) -8] 


cos  8 - cos [uT (T-tg) -8] 


[sin [uT (T-tQ) -6]  + sin 


where  8 = x3(tQ) 
angle . 


-n- 

^ ft 


(s)ds  + uT(T-tQ)  is  defined  as  the  intercept 


Therefore,  the  terminal  constraint  problem  has  been  reduced  to 
solving  a pair  of  nonlinear  algebraic  equations  (14)  for  an  appro- 
priate set  (y,8,T).  A solution  often  exists  for  this  case  in  which 
the  number  of  unknowns  exceeds  the  number  of  ^quations . From  Equation 
(14),  we  obtain  [8] 

UT  1 ( 

cosB  = cosx3(tQ)  - — x^tg)  + -jxj^  (tQ)cosx3  (tQ)+x2  (tQ)  sinx3  (tQ) 

- ^rIxi(to)+x2<to"}-  <15) 


12. 


f 


From  Equation  (15)  , a proper  y and  3 can  be  solved  in  terms 
of  the  initial  conditions  (tQ)  ,x2  (tQ)  ,x3  (tQ)  ,uT  and  vT.  After 
Y and  3 are  solved,  the  intercept  time  T can  be  computed  by 
the  following  formula: 

T = tQ  + J-[2kTT  + tan-1  |]  , (16) 

where 

F = y[y+x1 (tQ) ]sin[6-x3(tQ) ] + y x2(tQ)  cos [ 3-x3  (tQ) ] 

VT 

- — {ysin  6- [y+x3 (tQ) ] sin  x3(tQ)  + x2(tQ)  cos  x3(tQ)} 

G = y [y+x^^  (tQ)  ] cos  [g-x3  (tQ)  ] -y  x2(tQ)  sin  [ 3-x3  (tQ)  ] 

vt  VT  , 

- — (ycos  B+ty+Xj^ftQ)  ]cos  x3(tQ)  + x2(tQ)  sin  x3(tQ)  - — ) 

k = 0,±1,...  such  that  T > t^. 

These  results  are  summarized  ir.  the  next  lemma. 

Lemma  2 : 

If  a constant  but  non-zero  angular  maneuver  of  the  target  is 
assumed,  then  there  exists  a triple  (y,0,T)  satisfying  (15)  - (16) 
which  solves  the  terminal  constraint  problem  (6)  and  (9)  for  every 
(x3 (tQ) ,x2 (tQ) ,x3 (tQ) ) £ R3.  The  corresponding  proper  control  action 


satisfies 


II  - 


(s)ds  - x3(tQ)  - 6 + uT(T-t0). 


Zero  Angular  Maneuver  of  the  Target  (uT=0) 

In  a similar  manner,  the  terminal  constraint  becomes 


[y+x^tQ)  ]cos  [8-x3  (tQ)  ]-x2  (tQ)  sin[B-x3  (tQ)  ] -vT (T-tQ) sinB  - y' 


[y+x1 (tQ) ]sin[6-x3 (tQ) ]+x2 (tQ) cos [3-x3 (tQ) ]+vT (T-tQ) cos3 


where  3 is  as  defined  in  (14)  . This  equation  can  be  reduced  to 


cos  3 = cos  x 3 ( t q ) + - [x1(tQ)cos  x3(tQ)  + x2(tQ)sin  x3(tQ)].  (19) 


Finally,  the  intercept  time  T is  determined  by 


T = tQ  + — { [y+x.,  (tn)  ]sin  x7(tn)  - x0(tn)cos  x,(tn)  - y sin  3>.  (20) 


Ll'u0; 


3'  0 2 0 


3 0' 


A feasible  T requires  T > , i.e.,  the  term  in  the  bracket  must 
be  positive.  This  is  true  only  for  those  initial  conditions  outside 
the  region  [8]  E defined  by 


E = {(0,y,z)  e R : either  y > 0,  z = (2k+l)Tr,  or  y < 0,z  = 2kir;  k = 0,±1,. 


These  results  are  summarized  in  the  next  lemma. 


14. 


Lemma  3 ; 

If  a zero  angular  maneuver  of  the  target  is  assumed,  then 
there  exists  a triple  (y,B,T)  which  solves  the  terminal  constraint 
problem  (6)  and  (9)  for  every  (x3 (tQ) ,x2 (tQ) ,x3 (tQ) ) e R \ E. 

It  should  be  noticed  that  in  the  previous  analysis  the  control 
function  u(t)  has  been  eliminated  for  simplicity  of  computation. 
However,  an  admissible  control  which  steers  the  missile  to  the  target 
at  T is  associated  with  the  triple  (y,8,T)  through  Equation  (17). 
Thus  the  set  Uc  of  admissible  controls  is  specified  by: 


f T 

Uc  = jueL2 ( [t0,T] ,R) : j u(s)ds  = x3 (tQ) -6+uT (T-tg) j 

tn 


(22) 


where  8 and  T are  determined  by  Equations  (15)  - (16)  or  (19)  - (20) . 

With  the  set  Uc  of  admissible  controls  furnished  as  in  (22) , 
we  are  ready  to  derive  the  solution  to  the  minimum  energy  problem  (8) . 

The  following  proposition  is  a direct  consequence  of  Theorem  1. 


Proposition  1: 

Given  the  system  (6)  and  (11) , there  exists  an  optimal  control 

* 

u e uc  which  minimizes  the  cost  (8)  subject  to  the  constraint  (9) 
for  each  appropriate  set  of  initial  conditions  (x^ (tp) ,x0 (t^) ,x3 (tp) , 
uT,vT) . This  control  is  given  by 


u (t)  = uT  + 


x 3 (tQ) -6 


T-t, 


(23) 


where  T and  8 are  given  as  discussed  in  Lemmas  2 and  3. 
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IV.  Singularly  Perturbed  Problem 

A more  complex  system  model  is  considered  in  this  section  in 
which  the  missile  turn  rate  is  taken  as  the  output  of  a first-order 
lag.  This  takes  into  account  the  practical  situation  in  which  the 
missile  turn  rate  is  furnished  by  a DC-motor  having  first-order 
actuator  dynamics . That  is, 


e u(t)  = -u(t)  + Uq  e > 0 


(24) 


where  Uq  is  the  real  input.  In  the  limiting  case  where  e approach- 
es zero,  this  consideration  is  generally  known  as  a 'singular  per- 
turbation' problem  [7] . 

The  cost  functional  in  this  case  becomes 


i | u^ (s) ds 


J(u0) 

subject  to  the  constraint 

x^(T)  = x2 (T)  = 0 for  some  T > tg. 

By  defining  z = u,  the  system  equations  can  be  expressed  by: 


(25) 


(26) 


x = Ax  + Bxz 


E > 0 


(27) 


ez  = -z  + u. 


J 
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where  A and  B are  the  same  as  defined  in  (7) . 

Because  it  was  shown  in  the  last  section  that  the  terminal 
constraint  problem  has  a solution  in  which  the  control  action 
satisfies  (17) , or  in  terms  of  the  new  state  z 


rT 

: z(s)ds  = x3(tQ)  - 0 + uT(T-t0), 

tft 


the  terminal  constraint  problem  (26)  also  has  a solution  provided: 


ifT  r‘ 

‘J  Jt  “ 


0 (s) e e ds  dt  = x3 (to)-0+uT (T-tQ)+ez (tQ) le 


-i 


Thus  the  set  Uc  of  admissible  controls  for  the  problem  (25)  - (27) 
is  the  collection  of  inputs  satisfying  (28) . 

The  existence  of  an  optimal  control  u*  e Uc  which  minimizes 
the  cost  (25)  can  be  easily  established  as  shown  in  Theorem  1.  By 
the  Maximum  Principle,  this  optimal  solution  satisfies 


uft(t)  = q (t ) 


p(t)  = -9H/3x  = -(A'+B'z)p 


eq(t)  = -3H/3z  = -x'B'p+q 


, eq (T)  = 0.  (30) 


In  the  above  expressions,  (p,q)  are  costates  corresponding  to  (x,z) 
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and  prime  denotes  the  matrix  transpose  operation.  It  can  be  easily 
checked  that  g^(x'B'p)  = 0, 

hence  x'B'p  = k,  a constant  to  be  determined  by  the  boundary  con- 
ditions . 

Substituting  k into  (30)  , q can  be  solved: 


q(t,e)  = 


r-  , 

-kje  e -lj  = uQ 


(t) 


(31) 


Then  from  Equations  (28)  - (29) , k is  given  by: 


T-t, 


1 

= el 


x3(t0)  *3  + u„,(T-tn)  + ez  (trt)  (e 


T ‘■O' 


(32) 


where 


= e £-1. 5 + j 


VT 


2(tQ-T) 


+ I e 


] 


+ T-t, 


These  results  are  summarized  in  the  next  proposition. 


Proposition  2: 

Given  the  system  (29) , there  exists  an  optimal  control 

* 

Uq  e Uc  which  minimizes  the  cost  (25)  subject  to  the  constraint  (26) 
for  each  appropriate  set  of  initial  conditions.  This  control  is 
given  by  (31)  and  (32)  where  B and  T are  given  as  discussed  in 
Lemmas  2 and  3 . 

It  should  be  noticed  that  the  singular  perturbation  comes  into 
the  problem  (27)  as  e approaches  zero.  This  is  clearly  seen  from 
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the  expression  of  the  optimal  control  (31)  - (32),  i.e. 


lim  k(e) 
e->0 


* 

lim  un (t , e) 
e+0  u 


which  is  exactly  the  same  as  that  in  (23) . Therefore,  for  e suffi- 
ciently small  the  solution  to  the  optimal  control  problem  (25) 
associated  with  a fourth-order  system  (27)  can  be  approximated  ar- 
bitrarily closely  by  the  solution  to  the  problem  (8)  associated  with 
the  third-order  system  (5) . 

In  fact,  not  only  the  reduction  of  system  order  is  shown  here, 
but  also  an  explicit  solution  to  the  optimization  problem  of  the 
quadratic  system^  is  derived.  This  provides  a great  deal  of  poten- 
tial to  implement  such  a control  law  in  practice  because  the  first- 
order  actuator  dynamics  have  been  included.  Actually,  this  result 
can  be  generalized  to  include  any  higher  order  actuator  dynamics  as 
long  as  the  constancy  of  the  control  area  is  sustained. 


T-trt  [x3(t0)  + UT(T_t0)] 


= - lim  k (e) 
£-*0 


P - 4 


x3(t0)-e 
UT  T-t_ 


tQ  < t < T, 


T 


Note  that  (27)  is  no  longer  a bilinear  system  as  defined  in  Section 
II;  instead,  it  is  sometimes  referred  to  as  a quadratic  system. 
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V.  A Least  Squares  Estimation 

Given  continuous  measurements  of  the  position  coordinates 
(x^ (t) ,x2 (t) ) and  the  missile  turn  rate  u(t)  together  with  the 
proportionality  parameter  y,  a least  squares  estimate  of  vT 
and  x^ttg)  based  on  the  knowledge  of  a constant  angular  maneuver 
of  the  target  is  considered  in  this  section.  The  purpose  in  con- 
sideration of  such  a formulation  is  to  provide  an  easily  implemented 
estimator  which  connecting  with  the  previously  obtained  optimal  con- 
troller gives  a feasible  solution  to  the  overall  missile  intercept 
problem. 

Let  the  target  speed  vT  be  modeled  by  a constant  v,  and  let 
the  initial  relative  heading  x3(tQ)  be  denoted  by  "a".  A least 
squares  estimate  of  (a,v)  results  upon  minimizing  the  functional: 


- T 

J(a,v)  = | ^x^t)  + v sin[a+U(t)]  - ux2(t)j2  dt 

+ | [x2 (t)  - v cos[a-KJ(t)]  + u[y+x1(t)]j2  dt 

t fx 


(33) 


where  U(t)  is  defined  in  terms  of  the  known  target  and  missile  turn 
rate  uT  and  u by : 


U(t) 


(uT-u)ds. 


(34) 


A necessary  condition  for  the  minimization  of  J is  that  the  partial 
derivatives  of  J with  respect  to  "a"  and  v vanish: 


- - - 


I . 


(a* , v*) 


(a* , v*) 


Observing  that  J is  quadratic  in  v,  the  best  estimate  v*  can 
be  uniquely  determined  in  terms  of  a* : 

v*  = - |x1  (T)  sin[a*+U  (T)  ] - x^t^sin  a*  + x2(tQ)cos  a* 

- x2 (T) cos [a*+U  (T) ] 

UT  £X1  (t)  cos  [a*+U  (t)  ] + x2  ( t)  sin  [a*+U  (t)  jjtyu  cos  [a*+U  (t)  ] dtj 


Similarly, 


0 = A cos  a*  + B sin  a* 


where 


A = x^Tjcos  U (T)  + x2  (T)  sin  U(T)  - x-^tg) 


| | [yU (t) +uTx1 (t) ] sin  U(t)  - uTx2(t)cos  U (t)  dtj 


B = -x1(T)sin  U (T)  + x2(T)cos  U (T)  - x2(tQ) 


| | [yU(t)+uTx1 (t) ]cos  U(t)  + uTx2(t)sin  U(t)  dtj  . 

t A 


From  Equation  (37)  , we  obtain 
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a*  = 2imr  + tan  ^ ^ 


m = 0 , ±1 , 


(38) 


in  which  m is  chosen  so  that 
valent  to: 


2 2 

rj/3az 


> 0. 


(a* , v*) 


This  is  equi- 


B cos  a*  > 0 , 


(39) 


After  a*  is  solved  from  (38)  - (39),  v*  is  given  by  (36). 


Simulation  Results 

The  least  squares  estimation  scheme  is  combined  with  the 
optimal  control  law  to  form  a step-by-step  control  of  the  missile 
intercept  system  for  selected  initial  conditions  and  target  turn 
rates.  The  initial  target  turn  rates  are  assumed  to  be  0.0  rad/sec 
and  0.15  rad/sec  while  the  line  speed  is  1000  ft/sec.  Two  sets  of 
sinusoidal  fluctuations  are  imposed  upon  the  target  turn  rate  and 
line  speed  to  test  the  sensitivity  of  the  system.  The  analytical 

expressions  of  the  target  turn  rate  and  line  speed  are  listed  in 
Table  1. 


Actual  Target 

Table  1. 
Turn  Rates  and 

line  speeds 

Figure 

No. 

UT 

(rad/sec) 

vT  (ft/sec) 

2 

0.15 

1000 

3 

0.00 

1000 

4 

-0.001+0.001 (cos  0 

•2t+sin  0.2t) 

1000 

5 

0.00 

1000 

I 6 

0.00 

980+20 (cos  0.2t+sin  0.2t) 

A time  constant  e = 0.5  sec  is  assumed  for  the  first-order 

actuator  dynamics.  During  the  first  estimation  interval  of  duration 

— 2 1 

0.5  sec.,  the  control  effort  was  assumed  to  be  u(t)  = 0.06e 
rad/sec. 

The  relative  trajectories  of  the  target  with  respect  to  the 
missile  are  shown  in  Figures  2-6.  It  is  clearly  seen  that  in  the 
case  where  either  the  target  turn  rate  or  its  line  speed  is  a known 
constant  (Figures  2,  3 and  5),  the  estimation  of  the  target  speed 
and  the  initial  relative  heading  is  exact,  and  an  ideal  intercept  is 
as  expected.  For  the  case  in  which  an  internal  fluctuation  of  either 
the  target  turn  rate  or  its  line  speed  is  not  known  in  advance 
(Figures  4 and  6) , the  step-by-step  control  action  is  taken  to  offset 
the  estimation  error,  which  prolongs  the  expected  intercept  interval 
and  in  these  cases  results  in  reasonable  convergence  after  a few  loops 
of  estimation. 

Unknown  fluctuations  on  the  target  turn  rate  of  higher  amplitude, 
e.g.  - 0 . 005+0 . 005 (cos  0.2t+sin  0.2t)  were  also  considered,  as  well 
as  a time-varying  sawtooth  target  line  speed,  in  order  to  test  the 
effectiveness  of  the  estimation  scheme.  Simulation  results  are  rela- 
tively poor  in  these  cases  even  though  further  estimation  steps  are 
called  upon  in  trying  to  reduce  the  estimation  error.  This  indicates 
the  high  sensitivity  of  the  estimation  scheme  to  derivations  from 
the  assumed  constant  values  for  the  unknown  parameters.  It  is  sug- 
gested that  a recursive  estimator,  which  utilizes  the  first  estima- 
tion data  to  initiate  a secondary  least  squares  estimation  on  either 
the  target  turn  rate  or  line  speed,  may  be  satisfactory  in  this  ob- 
jective . 


u p ( rad /sec ) 


vp  ( I03ft/sec ) 


2k. 
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VI . Conclusion 

The  application  of  the  bilinear  regulator  theory  to  a two- 
dimensional  pursuit-evasion  problem  was  considered.  An  explicit 
solution  to  the  minimum  energy  control  of  this  problem  was  obtained 
by  introducing  a new  design  criterion  that  the  line  speed  of  the 
pursuer  be  made  proportional  to  its  turn  rate.  Optimal  control 
appeared  in  a very  simple  form,  i.e.  a constant  determined  by  the 
boundary  conditions.  On  the  other  hand,  it  was  shown  that  a sub- 
optimal  control  law  can  always  be  constructed  which  will  drive  the 
missile  to  the  target  at  the  same  intercept  time  as  long  as  the 
area  swept  out  by  this  control  function  satisfies  the  same  constant. 
This  allows  the  control  engineer  a great  deal  of  flexibility  in  the 
design  of  a feasible  easily-implemented  sub-optimal  controller. 

An  extension  of  this  problem  to  the  singularly  perturbed  system 
was  also  studied.  It  was  shown  that  a closed  form  solution  for  this 
higher-order  system  can  also  be  obtained.  This  is  in  contrast  with 
the  asymptotically  approximated  solutions  derived  in  Reference  [7] 
for  general  nonlinear  systems.  A least  squares  estimate  of  the 
target  speed  and  the  relative  heading  was  discussed.  A straight- 
forward solution  was  obtained.  This  estimation  scheme  was  combined 
with  the  optimal  control  law  to  form  a step-Ly-step  closed  loop  con- 
trol law.  The  simulation  results  indicated  good  accuracy  for  an 
intercept  when  the  estimation  errors  are  small.  However,  the  sen- 
sitivity analyses  against  different  fluctuations  on  target  line 
speed  and  turn  rate  reflect  the  necessity  to  incorporate  a recursive 
estimation  scheme  in  correcting  the  fluctuation  effect,  which  is 
yet  to  be  investigated. 
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Further  studies  on  the  estimation  including  the  target  turn 
rate  and  speed  as  stochastic  models,  as  well  as  the  construction 
of  a feedback  control  law,  should  be  considered  for  this  problem. 
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